Minimal Cost Attribute Reduction through Backtracking
نویسندگان
چکیده
Test costs and misclassification costs are two most important types in cost-sensitive learning. In decision systems with both costs, there is a tradeoff between them while building a classifier. Generally, with more attributes selected and more information available, the test cost increases, and the misclassification cost decreases. We shall deliberately select an attribute subset such that the total cost is minimal. Existing decision tree approaches deal with this issue from a local perspective. They benefit from immediately available test results, therefore objects falling into different branches may experience different tests. In this paper, we consider the situation where tests have delayed results. Since we need to choose a test set for all objects, the attribute reduction problem is defined from a global perspective. We propose a backtrack algorithm with three pruning techniques to find a minimal cost reduct. Experimental results indicate that the pruning techniques are effective, and the algorithm is efficient on a medium sized dataset Mushroom.
منابع مشابه
Rules-based Classification with Limited Cost
In test cost-sensitive decision systems, it is difficulty for us to find an optimal attribute set and construct a quality classifier with limited cost. The minimal test cost-sensitive attribute reduction is proposed to address the former problem. However, it is inevitable to remove some good even better attributes in the minimal test cost-sensitive attribute reduction. As a result, the classifi...
متن کاملMinimal cost feature selection of data with normal distribution measurement errors
Minimal cost feature selection is devoted to obtain a trade-off between test costs and misclassification costs. This issue has been addressed recently on nominal data. In this paper, we consider numerical data with measurement errors and study minimal cost feature selection in this model. First, we build a data model with normal distribution measurement errors. Second, the neighborhood of each ...
متن کاملConstraint Sovling Engine based Nurse rostering with Intelligent Backtracking
Efficient utilization of time and effort is essential in Personnel scheduling problems to evenly balance the workload among the people and attempt to satisfy the personnel preferences. In Constraint Satisfaction Problem based scheduling problems, when a branch of the search fails the backtracking search algorithm back up to the preceding variable and try a different value for it. So here the mo...
متن کاملAnalysis of alternative objective functions for attribute reduction in complete decision tables
Attribute reduction and reducts are important notions in rough set theory that can preserve discriminatory properties to the highest possible extent similar to the entire set of attributes. In this paper, the relationships among 13 types of alternative objective functions for attribute reduction are systematically analyzed in complete decision tables. For inconsistent and consistent decision ta...
متن کاملTest-cost-sensitive attribute reduction
In many data mining and machine learning applications, there are two objectives in the task of classification; one is decreasing the test cost, the other is improving the classification accuracy. Most existing research work focuses on the latter, with attribute reduction serving as an optional pre-processing stage to remove redundant attributes. In this paper, we point out that when tests must ...
متن کامل